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Abstract 

We consider a cascade model of N different processors performing a distributed 
parallel simulation. The main goal of the study is to show that the long-time dynamics 
of the system has a cluster behavior. To attack this problem we combine two methods: 
stochastic comparison and Foster-Lyapunov functions. 

1 Introduction 

The present paper contains a probabilistic analysis of some mathematical model of asyn- 
chronous algorithm for parallel simulation. For the detailed discussion of synchronization 
issues in parallel and distributed algorithms we refer to [HE]. Here we give only a brief 
description of the problem. In large-scale parallel computation it is necessary to coordi- 
nate the activities of different processor which are working together on some common task. 
Usually such coordination is implemented by using a so-called message-passing system. 
This means that a processor shares data with other processors by sending timestamped 
messages. Between sending or receiving the messages the processors work independently. 
It can be happened that till the moment of receiving of a message some processor can pro- 
ceed farther in performing its program than the value of timestamp indicated in this newly 
received message; in this case the state of the processor should be rolled back to the indi- 
cated value. It is clear that due to possible rollbacks the mean speed of a given processor 
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in the computing network will be lower than its proper speed. One of the most important 
performance characteristics of the system is the progress of the computing network on large 
time intervals. 

Probabilistic models for such system are studied already for twenty years. From the 
probabilistic point of view these models consists of many relatively independent compo- 
nents which synchronize from time to time their states according to some special algorithm. 
The detailed review of all existing publications is out of range of this paper. We would 
like to mention only that the bibliography on this subject consists mostly of two group 
of papers. The first group of publication El El EH El ESI are devoted to the case of 
two processors. The paper [2] is of special interest since it contains an exhaustive basic 
analysis of the two-dimensional model and had a big influence on further research. The 
case of many processors was studied in [SI IH El El El EU EES]- An important difference of 
such models from two-dimensional case is that in a realistic model with more than two 
processors one message can provoke a multiple rollback of a chain of processors. Since 
the multi-dimensional model is much more complicated for a rigorous study, in the above 
papers the authors deal with the set of identical processors and their mathematical results 
are contained in preparatory sections before large numerical simulations. 

It should be noted also that probabilistic models with synchronization mechanism are 
interesting also for modelling database systems (see for example, [I]). Moreover, now 
synchronization-like interactions are considered as well in the framework of interaction 
particle systems [TB I IT7 1 fTH]. 

The model considered in the present paper is of special interest for the following rea- 
sons. We deals with a nonhomogeneus model consisting of several different processors. 
We consider case of message-passing topology other from the topology of complete graph 
which was considered in all previous papers. Our main interest is the cascade model which 
pressupose a subordination between processors. We put forward a conjecture on the cluster 
behavior of the system: processors can be divided into separated groups which are asymp- 
totically independent and have their own proper performance characteristics. Our main 
goal is to justify this conjecture. One should point out that in the case of complete graph 
topology the cluster decomposition into groups is degenerated and, thus, not interesting. 

We describe our model in terms of multi-dimensional continuous time Markov process. 
To get asymptotical performance characteristics of the model we combine two probabilistic 
methods (stochastic comparison and Foster-Lyapunov functions). 

The paper is organized as follows. In Section [2] we introduce a general continuous 
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time Markov model and define a cascade model as a special subclass of the general model. 
In Section 03 we pass to the embedded Markov chain. Main problem is now to study a 
long-time behavior of Markov chain with highly nonhomogeneous transition probabilities. 
To do this we consider relative coordinates and find groups of processors whose evolution 
is ergodic (convergences to a steady state) in these relative coordinates. To our opinion 
the method of Foster-Lyapunov functions seems to be the only one to prove the stability 
in relative coordinates for the Markov chain under consideration. First of all in Sectional 
we start from the case of two processors (N = 2) and the analysis here is rather simple 
and similar to [2|. In the study of the three-dimensional case (Section EJ) the main point 
is the proof of ergodicity. We propose an explicit construction of some nonsmooth Foster- 
Lyapunov function. Our construction is rather nontrivial as it can be seen by comparing 
with already existing explicit examples of Lyapunov functions (see [Zj). All this analysis 
bring us to some conclusions presented in Section [HI This section contains decomposition 
into groups (clusters) in the case of cascade model with any number of processors N and 
our main Conjecture HB1 We show that the proof of this conjecture could be related with 
progress in explicit construction of multi-dimensional Foster-Lyapunov functions. Analysis 
of random walks in Z™ (which was done in 0) shows that, in general, this technical problem 
may be very difficult. In the next papers we hope to overcome these difficulties by using 
specific features of our concrete Markov processes. 

Acknowledgements. The first author is very grateful to the team TRIO (INRIA- 
Lorraine) and to l'Ecole des Mines de Nancy for their hospitality during his stay at Nancy 
in summer 2004 when the main results of this paper were obtained. 

2 Description of continuous time model 
2.1 General model 

We present here some mathematical model for parallel computations. There are N com- 
puting units (processors) working together on some common task. The state of a processor 
k is described by an integer variable G Z which is called a local (or inner) time of the 
processor k and has a meaning of amount of job done by the processor k up to the given 
time moment. 

Assume that the state (x\, x%, . . . ,x^) of the system evolves in continuous time t G R + . 
Any change of a state is possible only at some special random time instants. Namely, 
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with any processor k we associate a Poissonian flow Il fc = {0 = <7q < of < ■ ■ ■ < a* < • • •} 
with intensity A& and with a pair (k, I) of processors we associate a Poissonian flow 
Il kl = {0 = a™ < a kl < ■ ■ ■ < a kl < ■ ■ •} with intensity f3 k i ■ This means, for example, that 
{a k — Cn-i}^=i i s a sequence of independent exponentially distributed random variables 
with mean A^ 1 : Vn = 1,2, . . . P {a k — o k n _ x > s} = exp (— A^s), and similarly for the 
flows We also assume that all these flows n fc and U kl are mutually independent. 

Let us now define a stochastic process (X(t) = (xi(t), . . . ,xisr(t)) ,t G R+) on the state 
space Z N according to the following rules. 

1) At time instants a k the processor k increases its local time x k by 1: Xki&n + 0) = 

2) There is an exchange of information between different processors. At time instant af 
the processor k sends a message to the processor /. We assume that messages reach 
their destination immediately. A message coming to node / from node k contains an 
information about local time x^a 1 ? 1 ) = x^ of the sender k. If at the time instant of' (when 
the message arrives to the node /) we have xi(af l ) > Xk{of) then the local time x\ 
rolls back to the value x^. xiipf + 0) = Xk(crf l ). Moreover, if the processor / rolls back, 
then all messages sent by the processor / during the time interval X = (0i(xk,cr^ l ),af l ), 
where 9i(x,u) := max{s < u : Xi(s) = x, xi(s + 0) = x + 1} ,crf l ), should be eliminated. 
This may generate a cascading rollback of local times for some subset of processors. For 

(x') 

example, assume that there is a processor q which received a message m lq l at some time 
instant s'sl and x q (af l ) > xi(s') = x[. Then the local clock of q should be rolled back 
to the value xi(s'): x q {pf + 0) = xi(s') and, moreover, all messages sent by q during the 
interval X = (6 q (xi(s'), erf), a kl ) should be deleted, and so on. Hence, at time instant a\ l a 
message from k to / can provoke a multiple rollback of processor l,q, . . . in the system. 

2.2 Cascade model 

From now we shall consider the following special subclass of the above general model. 

A chain of processors 1,2, ... ,N is called a cascade if any processor j can send a mes- 
sage only to its right neighbour j + 1. Hence, the processor does not send any message 
and the processor 1 does not receive any message. In other words, ^ <^ (j = i + A 
message sent from j to j+1 can provoke a cascading roll-back of processors j+2, . . . . Recall 
that all above time intervals are exponentially distributed and assumed to be independent. 
Obviously, the stochastic process X^ N \t) = (a;! (£),..., xtv(£)) is Markovian. A very im- 
portant property is that any "truncated" marginal process Xc Nl \t) = (xi(t), . . . , x^it) ), 



4 



Ni < N, is also Markovian. 

Assume that for any j the following limit 

x(t) 

v* = lim — — (in probability) (1) 

J t-*+oo t 

exists. Then the numbers v*, j = 1,...,N, characterize performance of the model. The 
main goal of the present paper is to prove the existence of these limits and to calculate 
them. 

Note that if we uniformly transform the absolute time scale t = cs, where c > is a 
constant and s is a new absolute time scale, the performance characteristics JH) will not 
change. 



3 Definition of the discrete time cascade model 

Consider a sequence 

o = T < n < t 2 <-••<- ■ 

of time moments when changes of local time at nodes may happen (we mean local time 
updates and moments of sending of messages). It is clear that {r r+ i — r r }^° =0 is a sequence 
of independent identically distributed r.v. having exponential distribution with parameter 

N N-l 

i=i i=i 

Observing the continuous time Markov process (xx(t), . . . , xjy(t)) at epochs r n we get the so- 
called embedded discrete time Markov chain {X{n),n = 0, 1, . . .} with state space Z^. In 
the sequel we will be interested in the long-time behaviour of the chain {X(n), n — 0, 1, . . .}. 

Transition probabilities. In the MC {X(n),n = 0,1, .. .} there are transitions pro- 
duced by the free dynamics and transitions generated by rollbacks. By the free dynamics 
we mean updating of local times 

P {X(n + 1) = x + ej | X(n) = x} = XjZ" 1 , j = 1, . . . , N , 



5 



where ej = (0, . . . , 0, 1 , 0, . . . , 0). It is easy to see that if a state x — (x\, . . . , xn) is such 

3 

that for some j Xj < Xj + i then a message sent from j to j + 1 produces a transition of 
the following form 

(Xl, . . . , Xj, X j+ i, . . . , Xi, SCj+i, . . . , X N ) — > (Xi, . . . , Xj, W j+ i, ...,Wl, Xl+i, . . . , x N ) (2) 

with probability 

Z-%, j+ i II P(w q ,x q ,w q+1 ,x q+1 ) x (1 - &z) -(, !> , i+1 -i)-. !+1 ^ (3) 

where 

• sequence (u>j+i, . . . , itfz) is admissible in the following sense: 

j <1<N, w j+1 = Xj w q < w q+l < min (x q , x q+1 - 1) , {j < q < I) 



p(lV q , X q ] W q+ l, X q+1 ) =b q (l- b q 



+ ii -^q+ij "q V x "ql 

A„ 



Here 6 g is the probability of an event that processor q in state x 9 sends at least one message 
to q + 1 before updating its state x g — > x q + 1. For q = N we put 6^ = 0. So in the case 
/ = N the probability (JHJ takes the form 

AT-l 

Relative coordinates. Note that the first processor Xi(t) evolves independently of other 
processors. It is useful to introduce new process Y c (t) = (y 2 (t), . . . ,yN(t)) G Z N ^ in 
relative coordinates as viewing by an observer sitting at the point Xi(t): 

V j{t) :=Xj{t)-x x {t), j = 2,...,N. 
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In a similar way we define Y{n) = F c (r n ), n = 0, 1, . . . . The free dynamics produce the 
following transitions of Y(n): 

P{Y(n+l)=y + e j \Y(n)=y} = \ 3 Z~\ j = 2,...,N, (4) 
p{Y(n + l)=y-Y$Ue j \Y(n)=y} = XiZ~\ (5) 

Since rollback does not affect on the first processor the corresponding transitions have the 
same form and the same probabilities as J2J) and (jHJ). 



4 Stochastic monotonicity 



All statements of this section are valid for the both Markov processes Xc (t), t G R+, 
and X(n), n G Z+. For the sake of breavity we give here results only for the contionuous 
time model Xd (t). The following results will play a significant part in the proof of the 
Theorem |U in Section HI 

Theorem 1. Let us consider two cascade models (say X^(t) andX^it) ) with processors 
1, 2, . . . , n and parameters Ai, . . . , A n and (3^ 2 , (3^ , . . . , (3^1i n for the first model ™ (i) 
and parameters Ai, . . . , A n and (3^ 2 , (3^ , . . . , n for the second model xj, n 2 \t). Assume 
that 

Then X^ is stochastically larger than Xj^ , that is: if'X^(0) = X^(0) thenX^(t) > st 
X$(t)for anyt. 1 

Proof may be given by an explicit coupling construction of the processes X^(t) and 
X^(t) on the same probability space. The following fact should be used: a Poisson flow 
with intensity (3^ can be obtained from a Poisson flow with intensity (3^ in which any 
point (independently from other) is killed with probability 1 — (3± 2 j P\ 2 ' . 

Corollary 2 (Solid barriers). Fix some 1 < r\ < r 2 < • • • < r& < n and consider two 
cascade models: X^(t) with parameters y Ai, . . . , A n ; (3$ , /3<x$ , • • • , ^n-i,n J an d xf?$(t) 

J It means that there exists a coupling (x[ n \t, w), X^{t, w)J of stochastic processes x[ n \t) and X^\t) 

such that p|cj : x[ n \t, w) > X^ft, uj) Vtj = 1. If w, z € R" we say w > z if Wi > Z% for all i = 
1, . . . , n (partial order). 
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with parameters ^ Ai, . . . , A n ; (3^, , ■ ■ ■ , Pn-i,n ) > where 

/fii = /$+i W?{n,...,r b }, (3^ = Vze{r 1; ...,r 6 }. 

We can say that the model xj$ (t) differs from the model (t) by the presence of b solid 
barriers between processors r\ and ri + 1, . . . , r& and r& + 1. Then by Theorem^ we have 
that 



5 Case N = 2 

(2) 

We start with the Markov chain Xc (t). Since processor 1 works independently, it is 

(2) 

enough to consider the Markov chain F c (t) = x 2 (t) — Xi(t). 

Bearing in mind the remark at the end of Subsection for brevity of notation let 
us rescale absolute time in such a way that Z — 1. Then the Markov chain Y{n) has the 
following transition probabilities 

Pi,i+i = A 2 , p*,i-i = Ai, Pi )Q = (3 12 (i > 0), = /3i 2 (i < 0) 

and = for any another pair i,j . 




Theorem 3. If X\ < A 2 then the Markov chain Y{n) is ergodic and we have v* — v\ — \\. 
If X\ > A 2 then the Markov chain Y{n) is transient and we have v\ = Ai, f 2 = A 2 . 

Proof. The Markov chain Y(n) is one-dimensional and its analysis is quite easy. To es- 
tablish ergodicity under assumption Ai < A 2 we use the Foster-Lyapunov criterion (Theo- 
rem fT6l see Appendix) with test function f{y) — \y\, y e Z. This implies that x 2 (t) —xx(t) 
has a limit in distribution as t — > oo. Recall that x\{t) is a Poissonian process hence 



8 



the limit v\ = t 1 liinxi(t) = Ai exists (in probability). It follows from this that v* 2 = 

t~ l livaxoit) = Ai. 

t 

Under assumption Ai > A2 we get transience by choosing the function f(y) = mm(e 5y , 1), 
y G Z, where we fix sufficiently small 5 > 0, and applying Theorem IT7I from Appendix. 
Therefore any trajectory of Y(n) spends a finite time in any prefixed domain {y > C} 
entailing lim t ^oo x 2 (t) — x\{t) = —00 (a.s.). It means that after some time, the messages 
from 1 to 2 can not produce a rollback anymore, so Xi(t) and X2(t) become asymptotically 
independent and hence v* 2 = £~ lim2r 2 (t) = A 2 . 



Theorem 4. Four situations are possible. 

1. If Ai < min (A2, A3) then v * = v\ = v$ = \±. 

2. If A 2 > Ai > A 3 then v{ = v\ = X u v% = A 3 . 

3. If A 2 < min (Ai, A 3 ) then v{ = X 1; v% = v% = A 2 . 
4- If Ai > A 2 > A 3 then v{ = A 1? v% = A 2 , v% = A 3 . 

Items 2, 3 and 4 can be reduced in some sense to the results of the case N = 2 (see 
Theorem EJ. We prove them in the current section. Proof of the item 1 is much more 
intricate and relies heavily on the construction of an adequate Lyapunov function needing 
lengthy developments deferred to the following section 

Proof of Theorem |4] (items 2—4). We start from the item 2: A 2 > Ai > A 3 . Since 
the first two processors are governed by the Markov chain X^\t) and do not depend on 
the state of processor 3 we apply Theorem El and conclude that Xc (t) is ergodic and 

Let us compare the following two cascade models 



□ 



6 Case N = 3 



1 



Pl,2 _ 02,3 




m : 
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2 



o 



3 
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(parameters Ai, A2 and A3 are the same for the both models X c (t) and X c 2 (t) ). 

In the model the groups of processors {1,2} and {3} evolve independently. Ev- 
idently, an asymptotic speed of processor 3 in the model X^ exists and is equal to A3. 
By Corollary 01 X c (t) < st X^if)- Hence in the model Xc an asymptotic speed of the 
processor 3 is not greater than A3. Since A3 < Ai we conclude that there exists some time 
moment T such that for t > T in the model X^ messages from 2 to 3 that roll back 
the processor 3 will be very "rare". So these rare rollbacks will be not essential for an 
asymptotical speed of the processor 3. In other words, as t — > 00 the groups of processors 
{1,2} and {3} of the model X c become asymptotically independent, so the processor 3 
will move with the average speed A3. 

Items 3 and 4 can be considered in a similar way. Note the item 3 consists of two 
subcases: Ai > A 3 > A 2 and A 3 > Ai > A 2 . We omit details. 



7 Explicit construction of Lyapunov function 

In this section we prove the item 1 of Theorem HJ Recall that our key assumption here is 

Ai < A 2 , Ai < A 3 . (6) 

The main idea is to prove that the Markov chain Y{n) is ergodic. To do this we apply the 
Foster-Lyapunov criterion (see Theorem fT6l in Appendix). As in the case of Theorem El 
ergodicity of Y{n) implies that v * = Ai, j = 1, 2, 3 . 

7.1 Transition probabilities 

Consider the embedded Markov chain Y{n). A stochastic dynamics produced by this 
Markov chain consists of two components: transitions generated by the free dynamics and 
transitions generated by roll-backs. For each transition probability a 7^ (3, we have 
the following representation: 

Paf3 = s a p + r af3 , (7) 

where s a p > corresponds to a transition a — > [3 which occurs due to the free dynamics 
and r a/ 3 corresponds to a roll-back transition a — > (3. 

Taking into account the remark at the end of Subsection I2.2| without loss of generality 
we assume that the time is rescaled in such way that Z = 1. This slightly simplifies 
notation for transition probabilities. For example, free dynamics transitions (j3J)-(jEJ) are 
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equal to A2, A3 and Ai correspondingly. On the next figure we show all non-zero transitions 
a — > /3, (a 7^ (3). It is true, of course, that p aa = 1 — X^ Q Pa/?, but it is useless to put 
this information on the picture. Below we give the explicit form of rollback transition 
probabilities: 



1 -> 2 
2^3 



yz 



yz 



yz 



P12 for < y 2 
P23 for y 2 < y 3 

fail -b 2 y 3 h, 
Pn a -h) Z3 b 2 , 

f3 12 (i-b 2 r +i ., 



Z 3 < V3 

z 3 = y 3 
z 3 = yz 



for < y 3 < y 2 
for < y 2 < y 3 



where y = (y 2 ,y 3 ), z = (z 2 ,z 3 ) 
A3 



2^3 
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7.2 Contour of level 1 



In the plane Oy 2 y 3 consider the ellipse 



e(y 2 , ys) = ayl + b (y 2 - y 3 f = 1, 



a > 0,6 > 0, 



and draw a tangent line to it with normal vector (—A, 1). Evidently, there exist two 
tangent lines with the same normal vector (—A, 1). If A > is sufficiently large then one 
of this tangent line touches the ellipse at some point T 3 of the domain y 2 < 0, y 3 < 0. 
Take a segment on this line from the point T 3 to a point K 3 = (0,^3) of intersection with 
coordinate axis Oy 3 . Now let us draw tangent lines to the ellipse corresponding to a normal 
vector (1, —A). If A > is sufficiently large, then one of these lines touches the ellipse at 
some point T 2 of the domain y 3 < 0. Let us take this tangent line and fix a segment on 
it from the point T 2 to a point K 2 = («2, 0) of intersection with coordinate axis Oy 2 - It is 
evident that [K 2 K 3 ] = K 2 + n {(1/2,1/3) : 2/2/V2 + W M 3 = !}■ 

Let us consider now a closed contour L, consisting of subsequently joined segment 
K 3 K 2 , segment K 2 T 2 , arc T 2 T 3 of the ellipse and segment T 3 K 3 . This contour has the 
following property: any ray of the form{ct> , c > 0}, where v G R 2 , v ^ 0, has exactly one 
common point with the contour L. 
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We denote by n(y) the outer normal unitary vector of the contour L corresponding to 
the point y G L, n(y) is well defined at all points of L except the points K 2 and K 3 and, 
moreover, this function is continuous on L except the points K 2 and K 3 . The behaviour 
of n(y) on the arc T 2 T 3 is of prime interest: 

n(y) = || Ve ^ || . Ve(y) = 2( ay 2 + b(y 2 - y 3 ), -b{y 2 -y 3 )), y G T 2 T 3 C L . 
It is easy to see that n(y) = n(T 2 ) for y G (K 2 T 2 \, n(y) = n(T 3 ) for y e \T 3 K 3 ) and 

n(y) = (u 2 \u 3 l ) ye(K 3 K 2 ). 

For the sequel it is important to point out the following points of the arc T 2 T 3 : y^ = 
(-a' 1 / 2 , -a' 1 ' 2 ) and y^ , {y^} = T 2 T 3 n |^ 2) = ^yf}. It is easy to check that 

n{y^) || Oy 3 , n(y^)\\Oy 2 . 

Obviously, both points belong to the domain {y 2 < 0, y 3 < 0}. 
Lemma 5. 

The function n(y) has the following properties: 

• (n(y),y)^0VyeL\{K 2 ,K 3 } 

• If A > is sufficiently large then there exist continuous functions c 2 (y) and c 3 (y) 
such that 

c 2 (T 2 ) = c 3 (T 3 ) = 1, C2 (T 3 ) = c 3 (T 2 ) =0, c 2 (y) > 0, c 3 (y) > y E (T 2 ,T 3 ) 
n(y) = c 2 (y)n(T 2 ) + c 3 (y)n(T 3 ), y e (T 2 ,T 3 ). 

• (n(y), (0,-1)) <0ify 2 <0,y 3 > y 2 , and (n(y), (-1, 0)) < if y 2 > 0, y 3 < 0. 

7.3 Definition of function <p 

For any point (y 2 ,y 3 ) G R 2 \{0} define (p(y 2 ,y 3 ) > such that 

(3/2,2/3) 



<p(y2,y 3 ) 



G L. 
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For (1/2,1/3) = we put f(0,0) = 0. The function ^(2/2, 2/3) is well-defined and has the 
following properties: 

• ip : R 2 — > R_|_ (positivity) 

• <p(ry 2 ,ry 3 ) = np(y 2 ,y 3 ), r > 0, (homogeneity) 

• £ = {?/ : = !}■ 

To any point y = (2/2? 2/3) we put in correspondance a point y* := G L. Therefore, 
<p(y*) = 1. 



Lemma 6. 



The gradient V</?(y) exists at all points except that y for which y* = K 2 or K 3 and, 
moreover, the gradient is constant on rays of the form {cv, c > 0}, v G R 2 : 

ww- n(!/,) 



(y*,n(y*)) 
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• Let y = (2/2, 2/3) be such that y* G T 2 T 3 . Then 



\<p(w)-{V<p(y*),w)\ < 



const 




(8) 



<p(y) 



In other words, in a neighbourhood of the point y the function (p can be approximated 
by the linear function (V<p(y*), ■). 

In particular, (p(y) = (V<p(y*),y). 

Proof of Lemma El is a straightforward computation. 

7.4 Modification of the principle of local linearity 

For any state a define a set T a = {[3 : p a p > 0}. Recall decomposition and define 
F a = {P : s a/ 3 > 0} and R a = {/3 : r Q/ g > 0}. It is evident that T a = F a U R a . The most 
simple case is F a U R a = 0. The case F a U R a 7^ can be reduced to the previous one 
by a dilatation of the state space. Thus we assume that F a U R a = and consider the 
events {Y(n + 1) 6 F a } and {Y(n + 1) G On the set {a> G f2 : F(n, cj) = a} we have 

V(«+i)e^}M +V(n+i)efla}( w ) = L Hence, 



E (/(y(n + 1)) I K(n) = a) - f(a) = E ((f(Y(n + 1)) - f{y)) I {Y{n+ i)eF a} I r(n) = a) + 



It follows from definition of the Markov chain Y{n) (see Subsect. l7.ll) that the diameters 
d a := diamF a are uniformly bounded in a: d = max a d a < +00. Define a vector 



This is an analogue of a notion of mean jump (see (|T3l) in Appendix). In the next subsection 
we shall need the following modification of the principle of local linearity from |7| (see also 
Suhsect. lA.2l in Appendix). 

Lemma 7. Assume that the following condition holds 



+E ({j(Y(n + 1)) - f(y)) / { y(„ +1)€a> , | Yin) = a 



) 



M F (a) = E ({Y(n + 1) - a) I{ Y (n+i)eF a } \ Y(n) = a) = ^ {(5 - a)p a(3 . 



inf 



sup 

oeR n ,||a-a||<ci, 



\f{a)-l(a)\ < e 
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where inf is taken over all linear functions I. If 

f{a + M F {a))-f{a) < -he , 

then the following inequality 

E ( ( f(Y(n + 1)) - f(a) ) I {Y{ n+i)eF a } I Y(n) = a) < -s 

holds. 

The proof of this statement repeats the proof of principle of local linearity presented 
in and is omitted. 

7.5 Proof of the Foster condition 

The validity of the Foster condition will follow from several ancillary lemmas dealing with 
the following different domains of the state space: 

E- := {y = (y 2 , ys) ■ min(y 2 , Vz) < 0} , 

Ei := {y = (y 2 , y 3 ) ■ V2 > 0, y 3 > 0} , 

Ei,2 ■= {y2 >0,y 3 = 0}, 

E lj3 := {y 2 = 0,y 3 > 0}. 

Lemma 8. Consider the domain £L = {y = (y 2 , y 3 ) : mm(y 2 , y 3 ) < 0}. There exists G^> 
0, such that if <p(y) > Cj^J, then 
a) 

E ({y{Y{n + 1)) - <p(y)) I { Y( n +i)e^} I Y(n) = y) < , 
b) there exists e > such that 

v{y + M F (y))-^(y) < -he. 

Proof. It is evident that the vector 

M F {y) = (A a - A 1; A 3 - A x ) 

is constant (does not depend on y). Since the vector n(T 2 ) is co-directed with the vector 
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(1, —A) and the vector n(T 3 ) is co-directed with the vector (—A, 1) and the conditions 
A 2 > Ai, A 3 > Ai hold, we can find a large A x > such that 



(M F (y),n{T 2 )) < 0, (M F (y),n{T 3 )} < 0, VA > A : . 

Fix this Ai. Hence, by Lemma El there exists e > such that 

(M F (y),n(y)) < -6e if mm(y 2 ,y 3 ) < 0. 

Put w = y + M F (y) and consider 

(p(w)-<p(y) = <p(w) - (V<p(y*),y) 

= <p(w) - (V(p(y*),w) + (V(p(y*),w -y) 
= <p{w) - (V<p(y*),w) + (V<p{y*), M F {y)) . 

By ((HI) for any given £>0we can choose Co > such that 

\<p{y + M F (y)) - (Vip(y*), M F {y))\ <^-d 2 < £ if <p{y) > C . 

Now the item b) of the lemma easily follows. 

Let us prove the item a) of the lemma. Note that in the domain y 3 > y 2 , y 2 < a 
rollback decreases coordinate y 3 : (y 2 ,y 3 ) — > (y' 2 ,y 3 ) = (2/2? 2/2)- From geometrical properties 
of level sets of function (p and item 3 of Lemma El it follows that any transition generated 
by a rollback decreases a value of the function <p: </?( (y 2 , y 3 ) ) < ip( (y' 2 , y' 3 ) ). In the domain 
y 3 < 0, y 2 > a rollback has the following form: (y 2 ,y 3 ) — > (y' 2 ,y 3 ) = (0,2/3). For similar 
reasons we again have <p((y 2 ,y 3 )) < (p( (y' 2 , y' 3 ) ). In the domain y 3 < y 2 < there is no 
rollback. Now the item a) easily follows. □ 

Lemma 9. Consider the domain: E\ = {y — (y 2 ,y 3 ) : y 2 > 0,y 3 > 0}. 
1. The conditional expectation 

E ((<p(Y(n + 1)) - <p(y)) I { Y(n+i)eF y} I Y(n) = y) = ( (u 2 \ nj 1 ) , M F {y)) 
does not depend on y. 
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2. There exist constants Qfy ^ > such that 

E ((<p(Y(n + 1)) - <p(y)) I { Y in+ i)eR y } I Y(n) = y) < - W (y) if y{y) > Cfa (9) 

Proof. The first statement follows from the fact that in this domain <p(y) = ( (u 2 , u 3 x ) , y) , 
and the vector M F (y) does not depend on y. 
Let us prove ©. Fix some level set 

L+ := {y : (p(y) = C}r\E l = {y 2 /u 2 + y 3 /u 3 = C,y 2 > 0, y 3 > 0} 

and consider an action of rollbacks for y G Lq. We have three different situations. 

C 

a) Let y be such that y 2 > 2/3 > 0. It follows that 1/2 > -3 — ■ As ^ can ^ e eas ily 

concluded from Subsection I7.1| with probability f3i 2 we have a rollback of the following 
form (2/2,1/3) — ► (0,1/3) where < y' 3 < y 3 . Then we obtain 

<p((p,y' 3 ))-<p((y2,y3)) = 






, 2/3 




! 2/3 


u 2 


"3, 







< -— < J J («2 



«2 - + - 1 + « 2 /«3 

uniformly in y 3 such that y 3 < y 3 . To phrase it, we will say that with probability fii 2 the 

C 

increment of <p(y) is less or equal to ; — . Hence the conditional mean 

1 + u 2 /u 3 

E ((cp(Y(n + 1)) - <p{y)) I { Y {n+ i)eR y } I Y(n) = y) (10) 



P12C 
1 + u 2 /u 3 



does not exceed the value — - — ; — if y G y 2 > y 3 > 0. 



b) Let y G be such that < \y 3 < y 2 < y 3 . It follows that y 2 > 



c 



2 



With probability /3 12 we have a rollback {y 2l y 3 ) — > (0, y 3 ) where < y' 3 < y 3 and with 
probability /3 2 3 we have a rollback (j/ 2 , 2/3) — > (2/2, 2/2)- Both of them give negative increments 

of the function <p. But the first rollback gives the increment <p ((0, y' 3 )) — <p ((y 2 , y 3 )) which 

C _ 

is less or equal to ; ; — -. So we conclude that the above conditional mean (1101) 

2(1 + u 2 /u 3 ) 

will not exceed the value — (1 + u-i/u^ 1 C. 
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c) Now let y G be such that < y 2 < \y 3 . It follows that y 3 — y 2 > K(C) where 

1 C Cu 3 



K{C) :-- 



2 , j_\ u 3 /u 2 + 2 

\2u 2 ^ u 3 J 



With probability j3u we have a rollback (2/2,1/3) — > (0,2/3), < 2/3 < 2/3, an d with probabil- 
ity P23 we have a rollback (2/2,2/3) — *• (2/2,2/2)- The first rollback gives a negative increment 
of the function if, and the second rollback gives the increment y 9 ((2/2, 2/2)) — ^((2/2,2/3)) 
which is less or equal to — K{C) /u 3 . Hence the conditional expectation (|TTHl does not 
exceed the value —(3 2 zK(C)/u 3 = —/3 23 {u 3 /u 2 + 2) _1 C. 
The proof of the lemma is completed. 

□ 

Lemma 10. Consider the cases when y belongs to the axes: y G E\ y3 , y G E 2y3 . Here 
E ((<f(Y(n + 1)) - <p(y)) I {Y (n+i)eR y } I Y(n) = y) < -Ttffif>{y) , 

and 

E ({<p{Y{n + 1)) - y?(i/)) I{Y(n+i)eF y} \ Y(n) = y) 
does not depend on y, where y G {y : <p(y) > QjJ]^- 

Proof. We consider in details the case E 1>3 = {y 2 = 0, y 3 > 0}. We start with the free 
dynamics. The following transition 

(0, 2/3) -> (y 2 , y'z) = (-1, 2/3 - 1) e E 3 = {2/2 < 0, 2/3 > 2/2}- 

occurs with probability Ai. It is easy to see that for y 3 > Ofctf\ values of the function ip(-) 
in both points (0, y 3 ) and (— 1, y 3 — 1) concide with the values of linear function (n(T 3 ), ■). 
With probability A 2 we have a transition 

(0, 2/3) - (2/2, 2/3) = (1, 2/3) e £1 = {1/2 > 0, y 3 > 0}, 
and with probability A3 we have a transition 

(0,y 3 ) ^ (y' 2 ,y' 3 ) = (0,y 3 + l) e E 3 . 
Evidently, that in (0, y 3 ), (1, y 3 ) and (0, y 3 + 1) the values of <p(-) coincide with the values 
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of a linear function (ni, •). Hence, 



E ((tp(Y(n + 1)) - <p(y)) I{Y(n+i)eF y} I Y(n) = y) = Xi (n(T 3 ), (-1, -1)) 

+ A 2 (m, (1,0)) + Ag (m,(0,l)>. 

Since the r.h.s. does not depend on y we get the second statement of the lemma. 

Due to a rollback the Markov chain Y(n) goes from the state (0,1/3) G £1,3 to a state 
(0,0) with probability ^23 . Note that values of ip(-) at these two points can be calculated 
by using the linear function (n 1; •). Obviously, that the increment of tp corresponding to 
this rollback is equal to ip ( (0, 0) ) — (p ( (0, y 3 ) ) = — C, where C = (p( (0, y 3 ) ). The first 
statement of the lemma is proved. 

The case of the domain Ei >3 = {y 2 > 0, y 3 = 0} is similar. 

□ 

Lemma 11. For any C > 

sup E (ip(Y(n + 1)) I Y(n) = y) < +00 . 
y- f{y)<Co 

Proof. This statement follows from the fact that the jumps of any fixed neighbourhood of 
(0, 0) are bounded and the fact that the function ip is continuous. □ 

In view of the Lemmas I7HTT| the Foster-Lyapunov criterion (Theorem fTH| Appendix) is 
fulfilled with f(y) = ip(y) therefore the Markov chain Y{n) is ergodic and hence the proof 
of the item 1 of Theorem 0] is completed. 

8 Conclusions, conjectures and perspectives 
8.1 Decomposition into groups 

We shall always assume that all Ai, . . . , Xn are different. Define a function 

£(m) := min Aj . 

i<m 
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Evidently, this function has the following property: 



^(1) = Ai > ■ ■ • > £{m) > £{m + 1) > • • • > min(Ai, . . . , X N ) = £{N) . 

Level sets of function £ generate a partition of the set {1, . . . , N}. Namely, there exists 
a sequence jx — 1 < < 4 4 4 < 3k < jx+i = N + 1 such that the set of all processors can 
be divided into several nonintersecting groups 

{1,...,JV}= |J G k , (11) 

k=Tj< 



Gk ■= {jkJk + l, • • .Jk+i - 1} , £{jk - i) > £{jk) = ■ ■ • = £(jk+i - 1) > A 



Jk+l ■ 



Remark 12. An equivalent description of the group is possible. We say, for example, that 
1, 2, . . . , k is a group if 

Ai < min(A 2 ,...,Ajfe), Ai > A fc+ i. (12) 



8.2 Long-time behaviour of the groups 

Taking into account Theorems El and |U and the above notion of groups of processors we 
put forward the following Conjecture. 

Conjecture 13. Assume that all Aj, . . . , Ajv are different. For any j the following limit 
v* = lim — - exists and v* = £(j) . 

Therefore this conjecture entails v * = £(jk) for j G Gk- If for some k the group Gk 
consists of more than one processor we may say that the processors of the group Gk are 
synchronized. 

Remark 14 (On monotone cases). If Ai < • ■ ■ < Xn then v* = X± for any j. 

If Ax > • • • > Ajv then for all j we have v* = Xj. 

Let us discuss briefly perspectives of rigorous proof of the above Conjecture for large 
values of N. In fact, we have already proved this conjectures for a wide class of cascade 
models. 
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Theorem 15. Assume that all Ai, . . . , Ajv are different and a partition of the set of 

x(t) 

processors {1, . . . , iV} is such that \Gk\ < 3 for all k. Then the limits v* = lim — — exist 

J t^+oo t 

and v* = £(j) . 

The proof of this statement is just a combination of the result of Theorem HI (item 1) 
and arguments of the proof of items 2-4 of Theorem 0] We will not pursue further. 

So the key to the proof of Conjecture consists in generalization of item 1 of Theorem B] 
As it was seen in Section a possible way of such generalization is an explicit construction 
of Foster-Lyapunov function in high dimensions. This seems to be a difficult technical 
problem which is out of scope of this paper. 

A Appendix 

Let (£„, n = 0, 1, . . .) be a countable irreducible aperiodic Markov chain with the state 
space A. 

A.l Criteria 

We use the following Foster criterion. 

Theorem 16 (|7J). The Markov chain £ n is ergodic if and only if there exists a positive 
function f(a), a G A, a number e > and a finite set A e A such that 

1) 

E(/(£«+i) I & = !/)- /(!/)<-£ 

for all y ^ A, 

2) E {f(tn+i) I Cn = y)<+oo for all y e A. 

The following theorem give a criterion of transience. 

Theorem 17 (|7J). The Markov chain is transient, if and only if there exists a positive 
function f(a) and a set A such that the following inequalities are fulfilled 

E(/(£ m+1 )|£ m = aO-/K)<0, VaigA 
f(ctk) < hif f( a j)j f or a t least one a& $ A. 
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A. 2 Principle of local linearity 

From now on we assume that the state space A is some subset of R fc . Define a vector of 
mean jump from the point a 

M(a) = E ( (£ n+ i - a )\£ n = a) = - a)p aP . (13) 

P 

Assume that d a := max^ {\/3 — a\ : p a p > 0} < oo for all a. 
The following principle of local linearity was proved in 0. 

Lemma 18. Assume that at some point a the following condition 

inf sup \f(de) — l(a)\ < e, 

1 aeR n ,\\a-a\\<d a 

holds, where inf is taken over all linear functions I. Then 

f(a + M(a)) - f(a) < -5s =^ E ( /(£ n+1 ) - f\a) \£ n = cc)<-e. 
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